Bias Reduction in Instrumental Variable Estimation through First-Stage Shrinkage
نویسنده
چکیده
The two-stage least-squares (2SLS) estimator is known to be biased when its first-stage fit is poor. I show that better first-stage prediction can alleviate this bias. In a two-stage linear regression model with Normal noise, I consider shrinkage in the estimation of the first-stage instrumental variable coefficients. For at least four instrumental variables and a single endogenous regressor, I establish that the standard 2SLS estimator is dominated with respect to bias. The dominating IV estimator applies James–Stein type shrinkage in a first-stage highdimensional Normal-means problem followed by a control-function approach in the second stage. It preserves invariances of the structural instrumental variable equations. Introduction The standard two-stage least-squares (2SLS) estimator is known to be biased towards the OLS estimator when instruments are many or weak. In a linear instrumental variables model with one endogenous regressor, at least four instruments, and Normal noise, I propose an estimator that combines James– Stein shrinkage in a first stage with a second-stage control-function approach. Jann Spiess, Department of Economics, Harvard University, [email protected]. I thank Gary Chamberlain, Maximilian Kasy, and Jim Stock for helpful comments. 1 Unlike other IV estimators based on James–Stein shrinkage, my estimator reduces bias uniformly relative to 2SLS. Unlike LIML, it is invariant with respect to the structural form and translation of the target parameter. I consider the first stage of a two-stage least-squares estimator as a highdimensional prediction problem, to which I apply rotation-invariant shrinkage akin to James and Stein (1961). Regressing the outcome on the resulting predicted values of the endogenous regressor directly would shrink the 2SLS estimator towards zero, which could increase or decrease bias depending on the true value of the target parameter. Conversely, shrinking the 2SLS estimator towards the OLS estimator can reduce risk (Hansen, 2017), but increases bias towards OLS. Instead, my proposed estimator uses the first-stage residuals as controls in the second-stage regression of the outcome on the endogenous regressor. If no shrinkage is applied, the 2SLS estimator is obtained as a special case, while a variant of James and Stein (1961) shrinkage that never fully shrinks to zero uniformly reduces bias. The proposed estimator is invariant to a group of transformations that include translation in the target parameter. While the limited-information maximum likelihood estimator (LIML) can be motivated rigorously as an invariant Bayes solution to a decision problem (Chamberlain, 2007), these transformations rotate the (appropriately re-parametrized) target parameter and invariance applies to a loss function that has a non-standard form in the original parametrization. In particular, unlike LIML, the invariance of my estimator applies to squared-error loss. The two-stage linear model is set up in Section 1. Section 2 proposes the estimator and establishes bias improvement relative to 2SLS. Section 3 develops invariance properties of the proposed estimator.
منابع مشابه
Applications of James–Stein Shrinkage (II): Bias Reduction in Instrumental Variable Estimation
In a two-stage linear regression model with Normal noise, I consider James–Stein type shrinkage in the estimation of the first-stage instrumental variable coefficients. For at least four instrumental variables and a single endogenous regressor, I show that the standard two-stage least-squares estimator is dominated with respect to bias. I construct the dominating estimator by a variant of James...
متن کاملInstrumental Variable Estimation and Selection with Many Weak and Irrelevant Instruments
This paper proposes a new two stage least squares (2SLS) estimator which is consistent and asymptotically normal in the presence of many weak and irrelevant instruments and heteroskedasticity. In the first stage the estimator uses an adaptive absolute shrinkage and selection operator (LASSO) that selects the relevant instruments with high probability. However, the adaptive LASSO estimates have ...
متن کاملShrinkage methods for instrumental variable estimation∗
This paper proposes shrinkage methods for instrumental variable estimation to solve the “many instruments” problem. Even though using a large number of instruments reduces the asymptotic variances of the estimators, it has been observed both in theoretical works and in practice that in finite samples the estimators may behave very poorly if the number of instruments is large. This problem can b...
متن کاملTwo-stage instrumental variable methods for estimating the causal odds ratio: analysis of bias.
We present closed-form expressions of asymptotic bias for the causal odds ratio from two estimation approaches of instrumental variable logistic regression: (i) the two-stage predictor substitution (2SPS) method and (ii) the two-stage residual inclusion (2SRI) approach. Under the 2SPS approach, the first stage model yields the predicted value of treatment as a function of an instrument and cova...
متن کاملEmpirical Bayes estimation of the selected treatment mean for two-stage drop-the-loser trials: a meta-analytic approach
Point estimation for the selected treatment in a two-stage drop-the-loser trial is not straightforward because a substantial bias can be induced in the standard maximum likelihood estimate (MLE) through the first stage selection process. Research has generally focused on alternative estimation strategies that apply a bias correction to the MLE; however, such estimators can have a large mean squ...
متن کامل